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Question 1 
Intent of Question 
The primary goals of this question were to assess a student’s ability to (1) explain statistical terms used 
when describing the relationship between two variables: (2) interpret the slope of a linear regression 
equation; and (3) calculate a value of y when given a regression equation, a value of x, and a residual. 
Solution 


Part (a): 


In the context of a scatterplot in which y represents weight and x represents length, the following are 
defined. 


A positive relationship means that wolves with higher values of length also tend to have higher 
weights. 


A linear relationship means that as length increases by one meter, weight tends to change by a 
constant amount, on average. 





A strong relationship means that the data points fall close to a line (or curve). 
Part (b): 


The slope of 35.02 indicates that two wolves that differ by one meter in length are predicted to differ by 
35.02 kilograms in weight, with the longer wolf having the greater weight. 


Part (c): 
In general, a residual is equal to actual weight minus predicted weight, or equivalently, 
actual weight = predicted weight + residual. 
For the wolf with length 1.4 meters and residual of —9.67, the predicted weight is 
—16.46 + 35.02(1.4) = 32.568 kilograms. 


Therefore, the actual weight of the wolf is 32.568 + (—9.67) = 22.898 kilograms. 


Scoring 


Parts (a), (b), and (c) are scored as essentially correct (E), partially correct (P), or incorrect (I). 
Part (a) is scored as follows: 


Essentially correct (E) if the response includes the following four components: 
1. A reasonable definition of positive 
2. Areasonable definition of linear 
3. Areasonable definition of strong 
4 Atleast one definition in context 
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Question 1 (continued) 
Partially correct (P) if the response includes only three of the four components. 
Incorrect (I) if the response does not meet the conditions for E or P. 


Notes: 

e ‘The description of a positive relationship should clearly indicate that relatively low values of 
one variable tend to appear with relatively low values of the other variable, and relatively high 
values of the first variable tend to appear with relatively high values of the other variable. 

o Examples of acceptable responses: 

» As length increases, so does weight. 

» Longer wolves weigh more. 

=» ‘The points on the graph go up as you move from left to right. 
o Examples of unacceptable responses: 

» As length goes up, weight changes. 

"= Both length and weight get bigger. 

=» ‘The correlation is greater than 0. 

e The description ofa linear relationship can take one of two approaches: the data pattern (data 
points exhibit the pattern of a line in the graph) or the constant rate of change (as the 
explanatory variable changes, the response variable exhibits a constant rate of change). 

o Examples of acceptable responses: 
» The points generally follow a straight line. 
» The relationship between x and y is straight. 
» Length and weight have a constant slope. 
o Examples of unacceptable responses: 
» The points all line up. 
» You can draw a straight line through the points. 
» There is a positive correlation. 
=» Every increase in x yields a 35.02 increase in y. 
e ‘The description of strong should indicate how close points are to a line. 
o Examples of acceptable responses: 
=" Observed values are close to predicted values. 
=» Deviations from the least-squares regression line are small. 
» ‘The correlation coefficient is close to 1. 
o Examples of unacceptable responses: 
» All the points are close together. 
» The scatterplots are clustered together. 
» ‘There is a high positive correlation. 

e Context can be shown by referring to length and weight or by using meters and kilograms. 

e Sketches and graphs can be used to help clarify definitions, but a sketch alone cannot satisfy a 
definition component. 


Part (b) is scored as follows: 


Essentially correct (E) if the response includes the following three components: 
1. ‘The correct value of 35.02 for the slope. 
2. An interpretation that includes an increase of a specified amount of weight for each unit 
increase in length. 
3. An indication that the relationship is not exact by using words such as “on average” or 
“predicted weight.” 
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Question 1 (continued) 


Partially correct (P) if the response includes only two of the three components. 


Note: If the response identifies the slope as —16.46 (the intercept value), the second component is 
satisfied only if the response states that for each one-meter increase in length there is a decrease 
in predicted or average weight of 16.46 kilograms. 


Incorrect (I) if the response does not meet the criteria for E or P. 
Part (c) is scored as follows: 


Essentially correct (E) if the response includes the following two components: 
1. A correct computation for the predicted value 32.568 kilograms. 
2. Acorrect computation for the actual weight 22.9 kilograms using the given residual and the 
predicted value. 


Partially correct (P) if the response provides a correct computation for the predicted value but is not 
able to complete the correct calculation of the actual weight, including if the residual is defined in the 
wrong direction as (predicted weight) — (actual weight) to give an answer of 42.24 kilograms: 

OR 
if the response provides an incorrect value for the predicted weight, but then uses that value correctly 


to determine the actual weight as (predicted weight) + residual = (predicted weight) +(—9.67) ; 


OR 
if the response provides a correct answer for the actual weight but does not give sufficient information 
to determine how it was calculated. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 
e ‘The expression —16.46 + 35.02(1.4) is enough to satisfy the first component. 
e The equation —16.46 + 35.02(1.4) — 9.67 = 22.9 satisfies both components. 


e Arithmetic mistakes are overlooked if they do not lead to an unreasonable answer (such as a 
negative value). For example, 32.568 + (—9.67) = 21.9 satisfies the second component. 
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Question 1 (continued) 

Complete Response 

Three parts essentially correct 
Substantial Response 

‘T'wo parts essentially correct and one part partially correct 
Developing Response 

Two parts essentially correct and no parts partially correct 
a One part essentially correct and one or two parts partially correct 
OR 

Three parts partially correct 
Minimal Response 

One part essentially correct 


OR 
No parts essentially correct and two parts partially correct 
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SECTION II 
Part A 
Questions 1-5 
Spend about 65 minutes on this part of the exam. 
Percent of Section II score—75 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


nT 
) ee 
1. Researchers studying a pack of gray wolves in North America collected data on the length x, in meters, from 


nose to tip of tail, and the weight y, in kilograms, of the wolves. A scatterplot of weight versus length revealed 
a relationship between the two variables described as positive, linear, and strong. 


(a) For the situation described above, explain what is meant by each of the following words. 
(i) Positive: 
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The data collected from the wolves were used to create the least-squares equation ¥ = -16.46 + 35.02x 


(b) Interpret the meaning of the Slope of the least-squares regression line in context. 
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STATISTICS 
SECTION II 
Part A 
Questions 1-5 
Spend about 65 minutes on this part of the exam. 
Percent of Section II score—75 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correciness of your methods as well as on the accuracy and completeness of your results and explanations. 


V/ 


1. Researchers studying a pack of gray wolves in North America collected data on the length x, in meters, from 
nose to tip of tail, and the weight y, in kilograms, of the wolves. A scatterplot of weight versus length revealed 
a relationship between the two variables described as positive, linear, and strong. , 
(a) For the situation described above, explain what is meant by each of the following words. 
(i) Positive: | 
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The data collected from the wolves were used to create the least-squares equation } = —16.46 + 35.02x. 
(b) Interpret the meaning of the slope of the least-squares regression line in context. 
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(c) One wolf in the pack with a length of 1.4 meters had a residual of —9.67 kilograms. What was the weight 
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SECTION II - 
Part A 
Questions 1-5 | 
Spend about 65 minutes on this part of the.exam. 
Percent of Section II score—75 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


I. Researchers studying a pack of gray wolves in North América collected data on the length x, in meters, fromi 


hose to tip of tail, and the weight y, in kilograms, of the wolves. A scatterplot of weight versus length revealed 
a relationship between the two variables described as positive, linear, and strong, eR? | 
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7 NY 
(i) Positive: re he / 





When Me data is plotted, Aneve 18 gn. evuarn st mK 
. eS ce: are oe: ye, i or? L.. . a? ee 3 
— aeend between He length and wer 2 Sa ode a 


Wolves : 


(ii) Linear: | | 7 | 

When ne data i plotted on the scatterplot, there iS. a 

Ctrond relah onshi a between the Lengin and weig h+ of dhe - 

wolucs wit ¢ close to) or =| (iw Fas case 

1S close to Ty err _ Ay ty ot ee 





(iii) Strong: ht Gaede Se oun Ps i qi 3 ee | 
When the cleta 1s plotted, the re Lahonship beeen. the 
length and in€ iC h+ ot +'Ne wolves. (¢ Shee ar wn ‘ay can 
be 


—* 
oe 


be scen wher CHW NG A \me of best AT dae data shou 
(e€ac +e regressron line (No oAlvers ) “e 


any part of this page is illegal. , GO ON TO THE NEXT PAGE. 
| ine 3% | : -6- 


© 2017 ‘The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


““hneeyy - 
why 
‘ 
} 


The data collected from t the wolves were used to create the least- -squares equation ye ~16. 46 + 35. 02x. 


(b): ims Be the meaning of the slope of the (isk t-squares regression line in context. 


Foc eve vy ‘tmerease of the length ot E the wolves, he predic! ed 
; weeght | increases: by im SO 


' (¢) One wolf i in the pack with a length of 1.4 meters had. a ‘residual of —9.67 kiloprams. What was the weight 
of the wolf? ae ae 


“Eesidual - = observed - “ ered dic red 
Sei ive 435 22¢ 8) 
Pe 1: (5 AEE 





Bea ae ‘observed 5 HES 
“bsened : Z 56.818 


oO 4 


The «seg ot the wolf rg 55.512 Kilog cams 


Unauthorized copying or reuse of 
any Part of this page is illegal. 


GO ON TO THE NEXT PAGE. 


© 2017 ‘The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


AP® STATISTICS 
2017 SCORING COMMENTARY 


Question 1 
Overview 


The primary goals of this question were to assess a student’s ability to (1) explain statistical terms used 
when describing the relationship between two variables; (2) interpret the slope of a linear regression 
equation; and (3) calculate a value of y when given a regression equation, a value of x, and a residual. 


Sample: 1A 
Score: 4 


Each of the two sentences in the response to part (a-i) provides a reasonable definition of a positive 
relationship. ‘They both indicate that the value of one variable tends to increase as the value of the other 
variable increases. The first sentence also satisfies the context requirement for part (a) because it 
describes the relationship in terms of length and weight. The first sentence of the response to part (a-ii) 
provides a reasonable definition of a linear relationship by indicating that the points on the scatterplot 
exhibit a straight line pattern. The second sentence provides context. The response to part (a-iii) provides a 
reasonable definition of the strength of a relationship. It indicates that a relationship is strong when the 
points on the scatterplot are close to a line with the phrase “residuals are generally fairly small.” The two 
graphs displayed in the response provide an alternative acceptable explanation because one graph illustrates 
a strong relationship, and the other illustrates a weaker relationship. Because reasonable definitions are given 
for all three components, and at least one is given in context, part (a) was scored as essentially correct. In 
part (b) the response provides an acceptable interpretation of the slope of the least-squares regression line by 
indicating that the “least squares regression line predicts that the weight in kilograms of a wolf would 
increase by approximately 35.02” for each 1-meter increase in length. It uses the correct value for the slope, 
and it clearly indicates that the least-squares regression line does not describe changes in the actual weights 
of wolves. The response is presented in the context of lengths and weights of wolves. Part (b) was scored as 
essentially correct. In part (c) the response provides a correct formula for a residual as a difference between 
an actual weight and a predicted weight. It displays the correct formula and reports the correct value for 
the predicted weight. It shows how to use the residual to compute the actual weight of the wolf from the 
predicted weight. Part (c) was scored as essentially correct. Because three parts were scored as essentially 
correct, the response earned a score of 4. 


Sample: 1B 
Score: 3 


The statement about “positive correlation” in the first sentence of the response to part (a-i) essentially 
defines a positive relationship as a positive relationship. It does not explain the meaning of a linear 
relationship. The sentence is extraneous and does not affect the scoring. The second sentence provides a 
reasonable definition of a positive relationship, and it is presented in the context of the wolf study. The first 
sentence of the response to part (a-ii) merely introduces the concept that the response addresses. The 
second and third sentences describe a positive relationship. ‘his was considered extraneous and not a 
parallel response. The fourth sentence describes a consequence of a straight line relationship but does not 
explain what is meant by a linear relationship. ‘This response is not a reasonable definition of a linear 
relationship, but it does satisfy the context requirement for part (a). The first sentence of the response to 
part (a-iii) is too vague to provide a reasonable definition of the strength of a relationship. The first part of the 
second sentence is a reasonable definition of strength because it indicates that the points on the scatterplot 
are close to a straight line, in this case, the least-squares regression line. The rest of the second sentence 
enhances the explanation. Because reasonable definitions are given for two of the three components, and at 
least one is given in context, part (a) was scored as partially correct. 
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Question 1 (continued) 


In part (b) the response correctly interprets the slope as a 35.02 kg increase in y for each 1-meter increase 

in x. The correct value of the slope is used, and the response clearly indicates that the slope reflects a change 
in a predicted response. The response is presented in context. Part (b) was scored as essentially correct. In 
part (c) the response provides a correct formula and value for the predicted weight. It displays a correct 
formula for the residual and correctly calculates the actual weight of the wolf. Consequently, part (c) was 
scored as essentially correct. Context is provided in the form of units of length and weight, but context is 
not needed for an essentially correct response to part (c). Because two parts were scored as essentially 
correct, and one part was scored as partially correct, the response earned a score of 3. 


Sample: 1C 
Score: 2 


The reference to “an upward-right trend” in the response to part (a-i) is not sufficiently precise to qualify as 
a reasonable definition of a positive relationship. The inclusion of the graph, however, provides the 
additional explanation needed for an acceptable response. This is a reasonable definition of a positive 
relationship, and it satisfies the context requirement for part (a). The response to part (a-ii) does not 
address the meaning of a linear relationship. Instead it uses a value of a correlation coefficient close to 1 or 
—1 to describe a strong relationship. Although it is not a reasonable definition of a linear relationship, the 
response does satisfy the context requirement for part (a). In the response to part (a-iii), indicating that the 
“data should be near the regression line” provides a reasonable definition of a strong relationship. ‘The 
response also satisfies the context requirement for part (a). Because reasonable definitions are given for two 
of the three components, and at least one is given in context, part (a) was scored as partially correct. The 
response to part (b) correctly identifies the value of the slope, but the interpretation is incorrect because it 
indicates that the predicted weight will increase by 35.02 kg for “every increase” in length instead of a 
1-meter increase in length. Because the response refers to predicted weights and is presented in context, it 
was scored as partially correct. In part (c) the response provides a correct formula for the predicted weight, 
but the value for the predicted weight is incorrect. ‘This is an arithmetic error that does not affect the score. 
A correct formula is presented for computing the actual weight from the residual and the incorrect 
predicted weight. Part (c) was scored as essentially correct. Because one part was scored as essentially 
correct, and two parts were scored as partially correct, the response earned a score of 2. 
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